© 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 





© Publication number: 0 399 666 B1 



© 



EUROPEAN PATENT SPECIFICATION 



@ Date of publication of patent specification 
28.07.93 Bulletin 93/30 



© Int CI. 6 : C12N 15/62, C07K 13/00, 
C12P 21/02 



© Application number : 90304575.5 



Date of filing : 26.04.90 



© Fusion proteins containing N-terminal fragments of human serum albumin. 







Consolidated with 90907285.2/0470165 
(European application NoJpublication No.) by 
decision dated 20.07.92. 




References cited : 
EP-A- 0 308 381 
EP-A- 0 322 094 




® 


Priority: 29.04.89 GB 8909919 


© 


Proprietor: Delta Biotechnology Limited 
Castle Court, Castle Boulevard 
Nottingham NG7 1FD (GB) 




© 
® 


Date of publication of application : 
28.11.90 Bulletin 90/48 

Publication of the grant of the patent : 
28.07.93 Bulletin 93/30 




Inventor : Ballance, David James 
11 South Road 

West Bridgford, Nottingham NG2 7AG (GB) 




® 


Designated Contracting States : 

AT BE CH DE DK ES FR 6R IT U LU NL SE 


© 


Representative : Bassett, Richard Simon et al 
ERIC POTTER & CLARKSON St Mary's Court 
St. Mary's Gate 
Nottingham NG1 1LE (GB) 


CD 










to 

ID 










o 











Note : Within nine months from the publication of the mention of the grant of the European patent, any 
O person may give notice to the European Patent Office of opposition to the European patent granted. 
q_ Notice of opposition shall be filed in a written reasoned statement. It shall not be deemed to have been 
jjj filed until the opposition fee has been paid (Art. 99(1) European patent convention). 



Jouve, 18, rue Saint-Denis, 75001 PARIS 
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Description 

The present invention relates to fusion polypeptides where two individual polypeptides or parts thereof are 
fused to form a single amino acid chain. Such fusion may arise from the expression of a single continuous cod- 

5 ing sequence formed by recombinant DNA techniques. 

Fusion polypeptides are known, for example those where a polypeptide which is the ultimately desired 
product of the process is expressed with an N-terminal leader sequence" which encourages or allows secre- 
tion of the polypeptide from the cell. An example is disclosed in EP-A-116 201 (Chiron). 

Human serum albumin (HSA) is a known protein found in the blood. EP-A-147 198 (Delta Biotechnology) 

10 discloses its expression in a transformed host, in this case yeast. Our earlier application EP-A-322 094 dis- 
closes N-terminal fragments of HSA, namely those consisting of residues 1-n where n is 369 to 419, which 
have therapeutic utility. The application also mentions the possibility of fusing the C-terminal residue of such 
molecules to other, unnamed, polypeptides. 

One aspect of the present invention provides a fusion polypeptide comprising, as at least part of the N- 

15 terminal portion thereof, an N-terminal portion of HSA or a variant thereof and, as at least part of the C-terminal 
portion thereof, another polypeptide except that, when the said N-terminal portion of HSA is the 1-n portion 
where n is 369 to 419 or a variant thereof then the said polypeptide is (a) the 585 to 1578 portion of human 
f ibronectin or a variant thereof, (b) the 1 to 368 portion of CD4 or a variant thereof, (c) platelet derived growth 
factor, or a variant thereof, (d) transforming growth factor, or a variant thereof, (e) the 1-261 portion of mature 

20 human plasma f ibronectin or a variant thereof, (f) the 278-578 portion of mature human plasma f ibronectin or 
a variant thereof, (g) the 1-272 portion of mature human von Willebrand's Factor or a variant thereof, or (h) 
alpha-1-antitrypsin or a variant thereof. 

The N-terminal portion of HSA is preferably the said 1-n portion, the 1-177 portion (up to and including 
the cysteine), the 1-200 portion (up to but excluding the cysteine) or a portion intermediate 1-177 and 1-200. 

25 The term "human serum albumin" (HSA) is intended to include (but not necessarily to be restricted to) 

known or yet-to-be-discovered polymorphic forms of HSA. For example, albumin Naskapi has Lys-372 in place 
of Glu-372 and pro-albumin Christchurch has an altered pro-sequence. The term "variants" is intended to in- 
clude (but not necessarily to be restricted to) minor artificial variations in sequence (such as molecules lacking 
one or a few residues, having conservative substitutions or minor insertions of residues, or having minor va- 

30 nations of amino acid structure). Thus polypeptides which have 80%, preferably 85%, 90%, 95% or 99%, hom- 
ology with HSAare deemed to be "variants". It is also preferred for such variants to be physiologically equivalent 
to HSA; that is to say, variants preferably share at least one pharmacological utility with HSA. Furthermore, 
any putative variant which is to be used pharmacologically should be non-immunogenic in the animal (espe- 
cially human) being treated. 

35 Conservative su bstitutions are t hose where one or more amino acids are substituted for ot hers having sim- 

ilar properties such that one skilled in the art of polypeptide chemistry would expect at least the secondary 
structure, and preferably the tertiary structure, of the polypeptide to be substantially unchanged. For example, 
typical such substitutions include asparagine for glutamine, serine for asparagine and arginine for lysine. Va- 
riants may alternatively, or as well, lack up to ten (preferably only one or two) intermediate amino acid residues 

40 (ie not at the termini of the said N-terminal portion of HSA) in comparison with the corresponding portion of 
natural HSA; preferably any such omissions occur in the 100 to 369 portion of the molecule (relative to mature 
HSA itself) (if present). SimParly, up to ten, but preferably only one or two, amino acids may be added, again 
in the 100 to 369 portion for preference (if present). The term "physiologically functional equivalents" also en- 
compasses larger molecules comprising the said sequence plus a further sequence at the N-terminal (for ex- 

45 ample, pro-HSA, pre-pro-HSA and met-HSA). 

Clearly, the said "another polypeptide" in the fusion compounds of the invention cannot be the remaining 
portion of HSA, since otherwise the whole polypeptide would be HSA, which would not then be a "fusion poly- 
peptide". 

Even when the HSA-like portion is not the said 1-n portion of HSA, it is preferred for the non-HSA portion 
so to be one of the said (a) to (h) entities. 

The 1 to 368 portion of CD4 represents the first four disulphide-linked immunogiobulin-like domains of the 
human T lymphocyte CD4 protein, the gene for and amino acid sequence of which are disclosed in D. Smith 
et al (1987) Science 328. 1704-1707. It is used to combat HIV infections. 

The sequence of human platelet-derived growth factor (PDGF) is described in Collins etal (1985) Nature 
55 316 , 748-750. Similarly, the sequence of transforming growth factors B (TGF-B) is described in Derynck et al 
(1 985) Nature 316 , 701-705. These growth factors are useful for wound-healing. 

AcDNA sequence for the 1-261 portion of Fn was disclosed inEP-A-207 751 (obtained from plasmid pFH6 
with endonuclease Pvull). This portion binds fibrin and can be used to direct fused compounds to blood dots. 
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A cDNA sequence for Ihe 278-578 portion of Fn, which contains a collagen-binding domain, was disclosed 
by R.J. Owens and F.E. Baralle in 1986 E.M.B.O.J. 5, 2825-2830. This portion will bind to platelets. 

The 1-272 portion of von WDIebrand's Factor binds and stabilises factor VIII. The sequence is given in Bon- 
tham et a], Nud. Acids Res. 14, 7125-7127. 
s Variants of alpha-1-antitrypsin include those disclosed by Rosenburg et a[ (1984) Nature 312, 77-80. In 

particular, the present invention includes the Pittsburgh variant (Met 358 is mutated to Arg) and the variant where 
Pro 357 and Met 35 * are mutated to alanine and arginine respectively. These compounds are useful in the treat- 
ment of septic shock and lung disorders. 

Variants of the non-HSA portion of the polypeptides of the invention include variations as discussed above 
io in relation to the HSA portion, including those with conservative amino acid substitutions, and also homologues 
from other species. 

The fusion polypeptides of the invention may have N-terminal amino acids which extend beyond the por- 
tion corresponding to the N-terminal portion of HSA. For example, if the HSA-like portion corresponds to an 
N-terminal portion of mature HSA, then pre-, pro-, or pre-pro sequences may be added thereto, for example 
t5 the yeast alpha-factor leader sequence. The fused leader portions of WO 90/01063 may be used. The poly- 
peptide which is fused to the HSA portion may be a naturally-occurring polypeptide, a fragment thereof or a 
novel polypeptide, including a fusion polypeptide. For example, in Example 3 below, a fragment of fibronectin 
is fused to the HSA portion via a 4 amino acid linker. 

It has been found that the amino terminal portion of the HSA molecule is so structured as to favour par- 
20 ticularly efficient translocation and export of the fusion compounds of the invention in eukaryotic cells. 

A second aspect of the invention provides a transformed host having a nucleotide sequence so arranged 
as to express a fusion polypeptide as described above. By "so arranged", we mean, for example, that the nu- 
cleotide sequence is in correct reading frame with an appropriate RNA polymerase binding site and translation 
start sequence and is under the control of a suitable promoter. The promoter may be homologous with or het- 
25 erologous to the host. Downstream (3') regulatory sequences may be included if desired, as is known. The 
host is preferably yeast (for example Saccharomyces spp.. e.g. S. cerevisiae ; Kluyveromyces spp., e.g. K. lac- 
tjsj Pichia spp.; or Schizosaccharomyces spp., e.g. S. pombe) but may be any other suitable host such as E. 
coli , B. subtil is , Aspergillus spp., mammalian cells, plant cells or insect cells. 

A third aspect of the invention provides a process for preparing a fusion polypeptide according to the first 
30 aspect of the invention by cultivation of a transformed host according to the second aspect of the invention, 
followed by separation of the fusion polypeptide in a useful form. 

A fourth aspect of the invention provides therapeutic methods of treatment of the human or other animal 
body comprising administration of such a fusion polypeptide. 

In the methods of the invention we are particularly concerned to improve the efficiency of secretion of 
35 useful therapeutic human proteins from yeast and have conceived the idea of fusing to amino-terminal portions 
of HSA those proteins which may ordinarily be only inefficiently secreted. One such protein is a potentially 
valuable wound-healing polypeptide representing amino acids 585 to 1578 of human fibronectin (referred to 
herein as Fn 585-1578). As we have described in a separate application (filed simultaneously herewith) this 
molecule contains cell spreading, chemotactic and chemokinetic activities useful in healing wounds. The fusion 
40 polypeptides of the present invention wherein the C-terminal portion is Fn 585-1578 can be used for wound 
healing applications as biosynthesised, especially where the hybrid human protein will be topically applied. 
However, the portion representing amino acids 585 to 1578 of human fibronectin can if desired be recovered 
from the fusion protein by preceding the first amino acid of the fibronectin portion by amino acids comprising 
a factor X cleavage site. After isolation of the fusion protein from culture supernatant, the desired molecule Is 
45 released by factor X cleavage and purified by suitable chromatography (e.g. ion-exchange chromatography). 
Other sites providing for enzymatic or chemical cleavage can be provided, either by appropriate juxtaposition 
of the N-terminal and C-terminal portions or by the insertion therebetween of an appropriate linker. 

At least some of the fusion polypeptides of the invention, especially those including the said CD4 and vWF 
fragments, PDGF and c^AT, also have an increased half-life in the blood and therefore have advantages and 
so therapeutic utilities themselves, namely the therapeutic utility of the non-HSA portion of the molecule. In the 
case of a,AT and others, the compound will normally be administered as a one-off dose or only a few doses 
over a short period, rather than over a long period, and therefore the compounds are less likely to cause an 
immune response. 

55 EXAMPLES : SUMMARY 

Standard recombinant DNA procedures were as described by Maniatis etal (1982 and recent 2nd edition) 
unless otherwise stated. Construction and analysis of phage M13 recombinant clones was as described by 
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Messing (1983) and Sanger etal (1977). 

DNA sequences encoding portions of human serum albumin used in the construction of the following mol- 
ecules are derived from the plasmids mHOB12 and pDBD2 (EP-A-322 094, Delta Biotechnology Ltd, relevant 
portions of which are reproduced below) or by synthesis of oligonucleotides equivalent to parts of this se- 

5 quence. DNA sequences encoding portions of human fibronectin are derived from the plasmid pFHDELI, or 
by synthesis of oligonucleotides equivalent to parts of this sequence. Plasmid pFHDELI, which contains the 
complete human cDNA encoding plasma fibronectin, was obtained by ligation of DNA derived from plasmids 
pFH6, 16, 54, 154 and 1 (EP-A-207 751; Delta Biotechnology Ltd). 

This DNA represents an mRNA variant which does not contain the 'ED' sequence and had an 89-amino 

to acid variant of the lll-CS region (R.J. Owens, A.R. Kornblihtt and RE. Baralle (1986) Oxford Surveys on Eu- 
karyob'c Genes 3 141-160). The map of this vector is disclosed in Fig. 11 and the protein sequence of the mature 
polypeptide produced by expression of this cDNA is shown in Fig. 5. 

Oligonucleotides were synthesised on an Applied Biosystems 380B oligonucleotide synthesiser according 
to the manufacturer's recommendations (Applied Biosystems, Warrington, Cheshire, UK). 

15 An expression vector was constructed in which DNA encoding the HSA secretion signal and mature HSA 

up to and including the 387th amino acid, leucine, fused in frame to DNA encoding a segment of human fibro- 
nectin representing amino acids 585 to 1578 inclusive, was placed downstream of the hybrid promoter of EP- 
A-258 067 (Delta Biotechnology), which is a highly efficient galactose-inducible promoter functional in Sac- 
charomyces cerevisiae . The codon for the 1578th amino acid of human fibronectin was directly followed by a 

20 stop codon (TAA) and then the S. cerevisiae phosphoglycerate kinase (PGK) gene transcription terminator. 
This vector was then introduced into S. cerevisiae by transformation, wherein it directed the expression and 
secretion from the cells of a hybrid molecule representing the N-terminal 387 amino acids of HSAC-terminally 
fused to amino acids 585 to 1578 of human fibronectin. 

In a second example a similar vector is constructed so as to enable secretion by S. cerevisiae of a hybrid 

25 molecule representing the N-terminal 1 95 amino acids of HSA C-terminally fused to amino acids 585 to 1578 
of human fibronectin. 

Aspects of the present invention will now be described by way of example and with reference to the ac- 
companying drawings, in which: 

Figure 1 (on two sheets) depicts the amino acid sequence currently thought to be the most representative 
30 of natural HSA, with (boxed) the alternative C-termini of HSA(1-n); 

Figure 2 (on two sheets) depicts the DNAsequence coding for mature HSA, wherein the sequence included 
in Linker 3 is underlined; 

Figure 3 illustrates, diagrammatically, the construction of mHOB16; 

Figure 4 illustrates, diagrammatically, the construction of pHOB31; 
35 Figure 5 (on 6 sheets) illustrates the mature protein sequence encoded by the Fn plasmid pFHDELI ; 

Figure 6 illustrates Linker 5, showing the eight constituent oligonucleotides; 

Figure 7 shows schematically the construction of plasmid pDBDF2; 

Figure 8 shows schematically the construction of plasmid pDBDF5; 

Figure 9 shows schematically the construction of plasmid pDBDF9; 
40 Figure 10 shows schematically the construction of plasmid DBDF12, using plasmid pFHDELI ; and 

Figure 11 shows a map of plasmid pFHDELI. 

EXAMPLE 1 : HSA 1-387 FUSED TO Fn 585-1578 

45 The following is an account of a preparation of plasmids comprising sequences encoding a portion of HSA, 

as is disclosed in EP-A-322 094. 

The human serum albumin coding sequence used in the construction of the following molecules is derived 
from the plasmid M13mp19.7 (EP-A-201 239, Delta Biotech- nology Ltd.) or by synthesis of oligonucleotides 
equivalent to parts of this sequence. Oligonucleotides were synthesised using phosphoramidite chemistry on 

50 an Applied Biosystems 380B oligonucleotide synthesizer according to the manufacturer's recommendations 
(AB Inc., Warrington, Cheshire, England). 

An oligonucleotide was synthesised (Linker A) which represented a part of the known HSA coding se- 
quence (Figure 2) from the Pstl site (1235-1240, Figure 2) to the codon for valine 381 wherein that codon was 
changed from GTG to GTC: 

55 
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Linker 1 



D 


P 


H 


B 


C 


Y 


GAT 


CCT 


CAT 


GAA 


TGC 


TAT 


CTA 


GGA 


GTA 


CTT 


ACG 


ATA 



5' 

3' ACGT 

1247 





A 


K 


V 


F 


D 


E 


F 


K 


15 


GCC 


AAA 


GTG 


TTC 


GAT 


GAA 


TTT 


AAA 




CGG 


TTT 


CAC 


AAG 


CTA 


CTT 


AAA 


TTT 


20 


P 

CTT 


L 

GTC 


1267 

V 

3' 












25 


GGA 


CAG 


5' 













Linker 1 was ligated into the vector M13mp19 (Norrander et al . 1983) which had been digested with Pstl 
and Hindi and the ligation mixture was used to transfect E.coli strain XL1-Blue (Stratagene Cloning Systems, 
San Diego, CA). Recombinant clones were identified by their failure to evolve a blue colour on medium con- 
30 taining the chromogenic indicator X-gal (5-bromo-4-chloro-3-indolyl-B-D-galactoside) in the present of IPTG 
(isopropylthio-0-galactoside). DNA sequence analysis cf template DNA prepared from bacteriophage particles 
of recombinant clones identified a molecule with the required DNA sequence, designated mHOB12 (Figure 3). 

M13mp19.7 consists of the coding region of mature HSAin M13mp19 (Norrander et al . 1983) such that 
the codon for the first amino acid of HSA, GAT, overlaps a unique Xhol site thus: 

35 

Asp Ala 

5' CTCGAGATGCA 3' 

40 3' GAGCTCTACGT 5' 

Xho l 

45 (EP-A-210 239). M13mp19.7 was digested with Xhol and made flush-ended by S1 -nuclease treatment and 
was then ligated with the following oligonucleotide (Linker 2): 

Linker 2 



55 



5'TCTTTTATCCAAGCTTGGATAAAAGA 3 
3'AGAAAATAGGTTCGAACCTATTTTCT 5 

Hindlll 



The ligation mix was then used to transfect E.coli XL1-Blue and template DNA was prepared from several 

5 



EP 0 399 666 B1 



plaques and then analysed by DNA sequencing to identify a done, pDBD1 (Figure 4), with the correct se- 
quence. 

A 1.1 kb Hindlll to Pstl fragment representing the 5' end of the HSA coding region and one half of the in- 
serted oligonucleotide linker was isolated from pDBD1 by agarose gel electrophoresis. This fragment was then 

6 iigated with double stranded mHOB12 previously digested with Hindlll and Pstl and the ligation mix was then 
used to transfect E.coli XL1-Blue. Single stranded template DNA was prepared from mature bacteriophage par- 
ticles of several plaques. The DNA was made double stranded in vitro by extension from annealed sequencing 
primer with the Klenow fragment of DNA polymerase I in the presence of deoxynucleoside triphosphates. Re- 
striction enzyme analysis of this DNA permitted the identification of a clone with the correct configuration, 

10 mHOB15 (Figure 4). 

The following oligonucleotide (Linker 3) represents from the codon for the 382nd amino acid of mature HSA 
(glutamate, GAA) to the codon for lysine 389 which is followed by a stop codon (TAA) and a Hindlll site and 
then a BamHI cohesive end: 



15 Linker 3 



EEPQNLIKJ 
20 5 ' GAA GAG CCT CAG AAT TTA ATC AAA TAA GCTTG 3' 

3 ' CTT CTC GGA GTC TTA AAT TAG TTT ATT CGAACCTAG 5 ' 

25 This was Iigated into double stranded mHOB15, previously digested with Hindi and Bam HI. After ligation, 

the DNA was digested with Hindi to destroy all non-recombinant molecules and then used to transfect E.coli 
XL1-Blue. Single stranded DNA was prepared from bacteriophage particles of a number of dones and sub- 
jected to DNA sequence analysis. One clone having the correct DNA sequence was designated mHOB16 (Fig- 
ure 4). 

30 A molecule in which the mature HSA coding region was fused to the HSA secretion signal was created by 

insertion of Linker 4 into Bam HI and Xhol digested M13mp19.7 to form pDBD2 (Figure 4). 

Linker 4 



35 
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V 
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F 




5' 


GATCC ATG 


AAG 


TGG 


GTA 


AGC 


TTT 


40 




G TAC 


TTC 


ACC 


CAT 


TCG 


AAA 


45 


I 


S 


L 


L 


F L 


F 


S 




ATT 


TCC 


CTT 


CTT 


TTT CTC 


TTT 


AGC 




TAA 


AGG 


GAA 


GAA 


AAA GAG 


AAA 


TCG 



60 
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s 


A 


Y 


S 


R 


G 


V 


F 


TCG 


GCT 


TAT 


TCC 


AGG 


GGT 


GTG 


TTT 


AGC 


CGA 


ATA 


AGG 


TCC 


CCA 


CAC 


AAA 



R R 
CG 3' 
GCAGCT 5 ' 

In this linker the codon for the fourth amino acid after the initial methionine, ACC for threonine in the HSA 
pre-pro leader sequence (Lawn et al , 1981), has been changed to AGC for serine to create a HindlH site.. 

A sequence of synthetic DNA representing a part of the known HSA coding sequence (Lawn et al .. 1981) 
(amino acids 382 to 387, Fig. 2), fused to part of the known fibronectin coding sequence (Kornblihtt et al .. 1985) 
(amino acids 585 to 640, Fig. 2), was prepared by synthesising six oligonucleotides (Linker 5, Fig. 6). The oli- 
gonucleotides 2, 3, 4, 6, 7 and 8 were phosphorylated using T4 polynucleotide kinase and then the oligonu- 
cleotides were annealed under standard conditions in pairs, i.e. 1+8, 2+7, 3+6 and 4+5. The annealed oligo- 
nucleotides were then mixed together and ligated with mHOB12 which had previously been digested with the 
restriction enzymes Hindi and EcoR I. The ligation mixture was then used to transfect E.coli XL1-Blue (Stra- 
tagene Cloning Systems, San Diego, CA). Single stranded template DNA was then prepared from mature bac- 
teriophage particles derived from several independent plaques and then was analysed by DNA sequencing. 
Aclone in which a linker of the expected sequence had been correctly inserted into the vector was designated 
pDBDFI (Fig. 7). This plasmid was then digested with Pstl and EcoRI and the approx. 0.24kb fragment was 
purified and then ligated with the 1.29kb Bam HI-Pstl fragment of pDBD2 (Fig. 7) and Bam Hl + EcoRI digested 
pUC19 (Yanisch-Perron, et al ., 1985) to form pDBDF2 (Fig. 7). 

A plasmid containing a DNA sequence encoding full length human fibronectin, pFHDELI, was digested 
with Eco RI and Xhol and a 0.77kb EcoRI-xhol fragment (Fig. 8) was isolated and then ligated with Eco RI and 
sail digested M13 mp18 (Norrander et al. , 1983) to form pDBDF3 (Fig. 8). 

The following oligonucleotide linker (Linker 6) was synthesised, representing from the Pstl site at 4784- 
4791 of the fibronectin sequence of EP-A-207 751 to the codon for tyrosine 1578 (Fig. 5) which is followed by 
a stop codon (TAA), a HindlH site and then a Bam Hl cohesive end: 

Linker 6 



GPDQTEMTIEGL 
GGT CCA GAT CAA ACA GAA ATG ACT ATT GAA GGC TTG 
A CGT CCA GGT CTA GTT TGT CTT TAC TGA TAA CTT CCG AAC 



Q P T V E Y Stop 
CAG CCC ACA GTG GAG TAT TAA GCTTG 

GTC GGG TGT CAC CTC ATA ATT CGAACCTAG 

This linker was then ligated with Pstl and HindlH digested pDBDF3 to form pDBDF4 (Fig. 8). The following 
DNAfragments were then ligated together with Bglll digested pKV50 (EP-A-258 067) as shown in Fig. 8: 0.68kb 
EcoRI- Bam HI fragment of pDBDF4, 1 .5kb Bam HI-StuI fragment of pDBDF2 and the 2.2kb Stul-EcoRI fragment 
of pFHDELI. The resultant plasmid pDBDF5 (Fig. 8) includes the promoter of EP-A-258 067 to direct the ex- 
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pression of the HSA secretion signal fused to DNA encoding amino acids 1-387 of mature HSA, in turn fused 
directly and in frame with DNA encoding amino acids 585-1578 of human fibronectin, after which translation 
would terminate at the stop codon TAA. This is then followed by the S.cerevisiae PGK gene transcription ter- 
minator. The plasmid also contains sequences which permit selection and maintenance in Escherichia coli and 
5 S.cerevisiae (EP-A-258 067). 

This plasmid was introduced into S.cerevisiae S150-2B (leu2-3 I eu2-112 ura3-52 trp1-289 his3- 1) by stan- 
dard procedures (Beggs, 1978). Transformants were subsequently analysed and found to produce the HSA- 
f ibronectin fusion protein. 

10 EXAMPLE 2 : HSA 1-195 FUSED TO Pn 585-1578 

In this second example the first domain of human serum albumin (amino acids 1-195) is fused to amino 
acids 585-1578 of human fibronectin. 

The plasmid pDBD2 was digested with Bam HI and Bglll and the 0.79kb fragment was purified and then 
15 ligated with BamH I-digested M1 3mp19 to form pDBDF6 (Fig. 6). The following oligonucleotide: 

5'-CCAAAGCTCGAGGAACTTC G-3' 

was used as a mutagenic primer to create a Xhol site in pDBDF6 by in vitro mutagenesis using a kit supplied 
20 by Amersham International PLC. This site was created by changing base number 696 of HSA from a T to a G 
(Fig. 2). The plasmid thus formed was designated pDBDF7 (Fig. 9). The following linker was then synthesised 
to represent from this newly created Xhol site to the codon for lysine 195 of HSA (AAA) and then from the 
codon for isoleucine 585 of fibronectin to the ends of oligonucleotides 1 and 8 shown in Fig. 6. 

25 Linker 7 



30 


D 

TC GAT 
A 


E 
GAA 
CTT 


L 
CTT 
GAA 


R 
CGG 
GCC 


D 

GAT 
CTA 


E 

GAA 
CTT 


G 
GGG 
CCC 


K 
AAG 
TTC 


A 
GCT 
CGA 


S 
TCG 
AGC 


S 
TCT 
AGA 


A 
GCC 
CGG 


K 
AAA 
TTT 


35 
40 


I 

ATC 
TAG 


T 
ACT 
TGA 


£ 
GAG 
CTC 


T 
ACT 
TGA 


P 
CCG 
GGC 


S 
AGT 
TCA 


Q 
CAG 
GTC 


P 

C 

GGG 


N 
TTG 


S 
AGG 


H 
GTG 


G 





This linker was ligated with the annealed oligonucleotides shown in Fig. 3, i.e. 2+7, 3+6 and 4+5 together 
with Xho l and EcoRI digested pDBDF7 to form pDBDF8 (Fig. 9). Note that in order to recreate the original 
HSA DNA sequence, and hence amino acid sequence, insertion of linker 7 and the other oligonucleotides into 

45 pDBDF7 does not recreate the Xho l site. 

The 0.83kb Bam Hi-StuI fragment of pDBDF8 was purified and then was ligated with the 0.68kb Eco RI- 
Bam HI fragment of pDBDF2 and the 2.22kb Styl-EcoRI fragment of pFHDELI into Bolll-digested pKV50 to 
form pDBDF9 (Fig. 9). This plasmid is similar to pDBDF5 except that it specifies only residues 1-195 of HSA 
rather than 1-387 as in pDBDF5. 

so When introduced into S.cerevisiae S150-2B as above, the plasmid directed the expression and secretion 

of a hybrid molecule composed of residues 1-195 of HSA fused to residues 585-1578 of fibronectin. 

EXAMPLE 3 : HSA 1-387 FUSED TO Fn 585-1578, AS CLEAVABLE MOLECULE 

55 In order to facilitate production of large amounts of residues 585-1 578 of fibronectin, a construct was made 

in which DNA encoding residues 1-387 of HSA was separated from DNA encoding residues 585-1578 of f ibro- 
nectin by the sequence 



8 



EP 0 399 666 B1 





I 


E 


G 


R 




ATT 


GAA 


GGT 


AGA 


5 


TAA 


CTT 


CCA 


TCT 



which specifies the cleavage recognition site for the blood clotting Factor X. Consequently the purified secreted 
product can be treated with Factor X and then the fibronectin part of the molecule can be separated from the 
10 HSA part. 

To do this two oligonucleotides were synthesised and then annealed to form Linker 8. 
Linker 8 



15 


E 


E 


P 


Q 


N 


L 


I 


E 


G 




GAA 


GAG 


CCT 


CAG 


AAT 


TTA 


ATT 


GAA 


GGT 




CTT 


CTC 


GGA 


GTC 


TTA 


AAT 


TAA 


CTT 


CCA 


20 






















R 


I 


T 


E 


T 


P 


S 


Q 


P 


25 
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TGA 
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N 
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35 
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AGG 


GTG 
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This linker was then ligated with the annealed oligonucleotides shown in Fig. 6, i.e. 2+7, 3+6 and 4+5 into 
Hindi andEcoRI digested mHOBl2, to form pDBDF10 (Fig. 7). The plasmid was then digested with Pstl and 
EcoRI and the roughly 0.24kb fragment was purified and then ligated with the 1 .29kb BamHI-Pstl fragment of 
40 pDBD2 and Bam HI and Eco RI digested pUC19 to form pDBDF11 (Fig. 10). 

The 1.5kb Bam HI-StuI fragment of pDBDF11 was then ligated with the 0.68kb EcoRI -BamHI fragment of 
pDBDF4 and the 2.22kb Stul-EcoRI fragment of pFHDELI into Bglll-digested pKV50 to form pDBDF12 (Fig. 
10). This plasmid was then introduced into S.cerevisiae S150-2B. The purified secreted fusion protein was 
treated with Factor X to liberate the fibronectin fragment representing residues 585-1578 of the native mole- 
45 cule. 
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Claims 

Claims for the following Contracting States : AT, BE, CH, LI, DE, DK, FR, IT, LU, NL, SE 

1. Afusion polypeptide comprising, as at least part of the N-terminal portion thereof, an N-terminal portion 
of HSA or a variant thereof and, as at least part of the C-terminal portion thereof, another polypeptide 
except that, when the said N-termina! portion of HSA is the 1-n portion where n is 369 to 419 or a variant 
thereof then the said polypeptide is (a) the 585 to 1578 portion of human f ibronectin or a variant thereof, 
(b) the 1 to 368 portion of CD4 or a variant thereof, (c) platelet derived growth factor or a variant thereof, 
(d) transforming growth factor B or a variant thereof, (e) the 1-261 portion of mature human plasma fi- 
bronectin or a variant thereof, (f) the 278-578 portion of mature human plasma fibronectin or a variant 
thereof, (g) the 1-272 portion of mature human von Willebrand's Factor or a variant thereof, or (h) alpha- 
1-antitrypsin or a variant thereof. 

2. Afusion polypeptide according to Claim 1 additionally comprising at least one N-terminal amino acid ex- 
tending beyond the portion corresponding to the N-terminal portion of HSA. 

3. A fusion polypeptide according to Claim 1 or 2 wherein there is a cleavable region at the junction of the 
said N-terminal or C-terminal portions. 

4. Afusion polypeptide according to any one of the preceding claims wherein the said C-terminal portion is 
the 585 to 1578 portion of human plasma fibronectin or a variant thereof. 

5. A transformed or transfected host having a nucleotide sequence so arranged as to express a fusion poly- 
peptide according to any one of the preceding claims. 

6. A process for preparing a fusion polypeptide by cultivation of a host according to Claim 5, followed by sep- 
aration of the fusion polypeptide in a useful form. 

7. A fusion polypeptide according to any one of Claims 1 to 4 for use in therapy. 
Claims for the following Contracting States : ES, GR 

1. A process for preparing a fusion polypeptide by (i) cultivation of a transformed or transfected host having 
a nucleotide sequence so arranged as to express a fusion polypeptide, followed by (ii) separation of the 
fusion polypeptide in a useful form, characterised in that the fusion polypeptide comprises as at least part 
of the N-terminal portion thereof, an N-terminal portion of HSA or a variant thereof and, as at least part 
of the C-terminal portion thereof, another polypeptide except that, when the said N-terminal portion of 
HSA is the 1-n portion where n is 369 to 419 or a variant thereof then the said polypeptide is (a) the 585 
to 1578 portion of human fibronectin or a variant thereof, (b) the 1 to 368 portion of CD4 or a variant there- 
of, (c) platelet derived growth factor or a variant thereof, (d) transforming growth factor B or a variant there- 
of, (e) the 1-261 portion of mature human plasma fibronectin or a variant thereof, (f) the 278-578 portion 
of mature human plasma fibronectin or a variant thereof, (g) the 1-272 portion of mature human von Will- 
ebrand's Factor or a variant thereof, or (h) alpha-1 -antitrypsin or a variant thereof. 

2. A process according to Claim 1, wherein the fusion polypeptide additionally comprising at least one N- 
terminal amino acid extending beyond the portion corresponding to the N-terminal portion of HSA 

3. A process according to Claim 1 or 2 wherein, in the fusion polypeptide, there is a cleavable region at the 
junction of the said N-terminal or C-terminal portions. 

4. A process according to any one of the preceding claims wherein the said C-terminal portion is the 585 
to 1578 portion of human plasma fibronectin or a variant thereof. 
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Patentanspruche 

Patentanspruche fOr folgende Vertragsstaaten : AT, BE, CH, DE, DK, FR, IT, LU, NL, SE 

5 1. Fusionspolypeptid, umfassend als mindestens einen Teil seines N-terminalen Teils einen N-terminalen 
Teil von HSA oder eine Variante davon und als mindestens einen Teil seines C-termina!en Teils ein wei- 
teres Polypeptid mit der Ausnahme, da a wenn es sich bei dem N-terminalen Teil von HSA urn den Tell 1- 
n mit n = 369 bis 419 oder eine Variante davon handelt, das Polypeptid aus 
(a) dem Teil 585 bis 1578 von Humanfibronectin oder einer Variante davon, 

10 (b) dem Teil 1 bis 368 von CD4 oder einer Variante davon, 

(c) dem "Platelet Derived Growth Factor" (PDGF) oder einer Variante davon, 

(d) dem "Transforming Growth Factor B" (TGF P) oder einer Variante davon, 

(e) dem TeB 1-261 von reifem Humanplasmaf ibronectin oder einer Variante davon, 

(f) dem Teil 278-578 von reifem Humanplasmaf ibronectin oder einer Variante davon, 

is (g) dem Tefl 1-272 von reifem Human-von Willebrand's-Faktor oder einer Variante davon oder 

(h) Alpha-1 -Antitrypsin Oder einer Variante davon, besteht 

2. Fusionspolypeptid nach Anspruch 1, zusatzlich umfassend mindestens eine N-terminale AminosSure, die 
l§nger als der dem N-terminalen Teil von HSA entsprechende Teil isL 

20 

3. Fusionspolypeptid nach Anspruch 1 Oder 2, bei dem sich an der Verbindung der N-terminalen oder C- 
terminalen Teile eine spaltbare Region befindet. 

4. Fusionspolypeptid nach einem der vorhergehenden Anspruche, wobei der C-terminale Teil aus dem Teil 
25 585 bis 1578 von Humanplasmaf ibronectin oder einer Variante davon besteht. 

5. Transformierter oder transf izier ter Wirt mit einer Nukleotidsequenz, die so angeordnet ist, daB sie ein Fu- 
sionspolypeptid nach einem der vorhergehenden Anspruche exprimieren kann. 

6. Verfahren zur Herstellung eines Fusionspolypeptids durch Kultivieren eines Wirts nach Anspruch 5 und 
30 anschliefiendes Abtrennen des Fusionspolypeptids in einer geeigneten Form. 

7. Fusionspolypeptid nach einem der Anspruche 1 bis 4 zur therapeutischen Verwendung. 
Patentanspruche fur folgende Vertragsstaaten : ES, GR 

35 

1. Verfahren zur Herstellung eines Fusionspolypeptids durch 

(i) Kultivieren eines transformierten oder transfektierten Wirts mit einer Nukleotidsequenz, die so an- 
geordnet ist, daB sie ein Fusionspolypeptid exprimiert, und 

(ii) anschlieBendes Abtrennen des Fusionspolypeptids in einer geeigneten Form, 
40 dadurch gekennzeichnet, daft das Fusionspolypeptid als mindestens einen Teil seines N-terminalen Teils 

einen N-terminalen Teil von HSA Oder eine Variante davon und als mindestens einen Teil seines C-ter- 
minalen Teils ein weiteres Polypeptid umfaBt, mit der Ausnahme, daB wenn es sich bei dem N-terminalen 
Teil von HSA um den Teil 1-n mit n= 369 bis 419 oder eine Variante davon handelt, das Polypeptid aus 
(a) dem Ten 585-1578 von Humanfibronectin oder einer Variante davon, 
45 (b) dem Te0 1-368 von CD4 oder einer Variante davon, 

(c) dem Platelet Derived Growth Factor oder einer Variante davon, 

(d) dem Transforming Growth Factor p oder einer Variante davon, 

(e) dem Teil 1-261 von reifem Humanplasmaf ibronectin oder einer Variante davon, 

(f) dem Teil 278-578 von reifem Humanplasmaf ibronectin oder einer Variante davon. 

so (g) dem Teil 1-272 von reifem Human-von Willebrand's-Faktor oder einer Variante davon oder 

(h) a-1 -Antitrypsin oder einer Variante davon besteht. 

2. Verfahren nach Anspruch 1, wobei das Fusionspolypeptid zusatzlich mindestens eine N-terminale Ami- 
nosaure, die linger als der dem N-terminalen Teil von HSA entsprechende Teil ist, umfaBt 

55 

3. Verfahren nach Anspruch 1 oder 2, wobei sich in dem Fusionspolypeptid an der Verbindung der N-termi- 
nalen oder C-terminalen Teile eine spaltbare Region befindet. 
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4. Verfahren nach einem der vor hergehenden Anspruche, wobei der C-terminale Teil aus dem Tail 585-1578 
von Humanplasmaf ibronectin Oder einer Variante davon besteht. 

Revendications 

Revendications pour les Etats contractants suivants : AT, BE, CH, DE, DK, FR, IT, LU, NL, SE 

1. Polypeptide fusionne comprenant en tant qu'au moins une partie de sa portion N-terminale, une portion 
N-terminale de HSA ou d'un variant de celle-ci et, en tant qu'au moins une partie de sa portion C-termi- 
nale, un autre polypeptide sauf que, iorsque cette portion N-terminale de HSA est la portion 1-n dans la- 
quelle n est 369 a 419 ou un variant de celle-ci, ce polypeptide est (a) la portion 585 a 1578 de la fibro- 
nectine humaine ou un variant de celle-ci, (b) la portion 1 a 368 de CD4 ou un variant de celle-ci, (c) le 
facteur de croissance derive des plaquettes sanguines ou un variant de celui-ci, (d) le facteur de crois- 
sance p de transformation ou un variant de celui-ci, (e) la portion 1-261 de la fibronectine mature de plas- 
ma humain ou un variant de celle-ci, (f) la portion 278-578 de la fibronectine mature de plasma humain 
ou un variant de celle-ci, (g) la portion 1-272 du facteur humain mature de von Wiilebrand ou un variant 
de celle-ci, ou (h) l'alpha-1-antitrypsine ou un variant de celle-ci. 

2. Polypeptide fusionne suivant la revendication 1, comprenant de plus au moins un acide amine N-terminal 
se prolongeant au-dela de la portion correspondent a la portion N-terminale de HSA. 

3. Polypeptide fusionne suivant les revendications 1 ou 2, dans lequel il y a une region susceptible d'etre 
coupee a la jonction de ces portions N-terminale et C-terminale. 

4. Polypeptide fusionne suivant Tune queiconque des revendications precedentes, dans lequel cette portion 
C-terminale est la portion 585 a 1578 de la fibronectine de plasma humain ou un variant de celle-ci. 

5. Hdte transforme ou transfecte ayant une sequence de nucleotides arrangee de facon a exprimer un po- 
lypeptide fusionne suivant Tune queiconque des revendications precedentes. 

6. Precede pour preparer un polypeptide fusionne par culture d'un hfite suivant la revendication 5, suh/ie de 
la separation du polypeptide fusionne sous une forme utile. 

7. Polypeptide fusionne suivant Tune queiconque des revendications 1 a 4 utilisable en therapie. 
Revendications pour les Etats contractants suivants : ES, GR 

1 . Proc6d6 pour preparer un polypeptide fusionne par (i) la culture d'un h6te transforme ou transfecte ayant 
une sequence de nucleotides arrangee de fapon a exprimer un polypeptide fusionne, suivie de (ii) la se- 
paration du polypeptide fusionne sous une forme utilie, caract6ris6 en ce que le polypeptide fusionne 
comprend, en tant qu'au moins une partie de sa portion N-terminale. une portion N-terminale de HSAou 
d'un variant de celle-ci et, en tant qu'au moins une partie de sa portion C-terminale, un autre polypeptide 
sauf que, Iorsque cette portion N-terminale de HSA est la portion 1-n dans laquelle n est 369 a 419 ou 
un variant de celle-ci, ce polypeptide est alors (a) la portion 585 a 1578 de la fibronectine humaine ou un 
variant de celle-ci, (b) la portion 1 a 368 de CD4 ou un variant de celle-ci, (c) le facteur de croissance 
derive des plaquettes sanguines ou un variant de celui-ci, (d) le facteur de croissance B de transformation 
ou un variant de celui-ci, (e) la portion 1-261 de la f ibronectine mature de plasma humain ou un variant 
de celle-ci, (0 la portion 278-578 de la fibronectine mature de plasma humain ou un variant de celle-ci, 
(g) la portion 1-272 du facteur humain mature de von Wiilebrand ou un variant de celle-ci. ou (h) I'alpha- 
1-antitrypsine ou un variant de celle-ci. 

2. Proc6d6 suivant la revendication 1, dans lequel le polypeptide fusionne comprend de plus au moins un 
acide amine N-terminal se prolongeant au-dela de la portion correspondent a la portion N-terminale de 
HSA. 

3. Precede suivant les revendications 1 ou 2 dans lequel, dans le polypeptide fusionne, il y a une region sus- 
ceptible d'etre coupee a la jonction de ces portions N-terminale et C-terminale. 
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4. Proced6 suivant Tune quelconque des revendtcations precedentes, dans lequel cette portion C-terminale 
est la portion 585 a 1578 de la f ibronectine de plasma humain ou un variant de celle-ci. 
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FIGURE I 

1 0 20 
Aso Ala His Lys Ser Giu Val Ala His Arg ?he Lys As? Leu Giy Glu Giu Asn ?he Lys 

30 <0 
Ala Leu val Leu lie Aia Phe Aia Gin Tyr Leu Gin Gin Cys Pro ?he Giu As? His vai 

50 50 
Lys Leu vai Asn Giu Vai Thr Glu Phe Ala Lys Thr Cys Val Ala As? Glu Ser Ala Glu 

70 30 
Asn Cvs as? Lys Ser Leu His Thr Leu Phe Gly As? Lys Leu Cys Thr val Ala Thr Leu 

90 ■ 00 

Arg Giu Thr Tyr Giy Giu Met Ala As? Cys Cys Ala Lys Gin Glu Pro Glu Arg Asn Glu 

ilO »20 
Cys Phe Leu Gin His Lys As? As? Asn Pro Asn Leu Pro Arc Leu val Arg Pro Glu val 

1 30 140 
As? val Met Cys Thr Ala Phe His As? Asn Glu Glu Thr ?he Leu Lys Lys Tyr Leu Tyr 

150 'SO 
Glu lie Ala Arg Arg His Pro Tyr Phe Tyr Aia Pro Glu Leu Leu Phe Phe Ala Lys Arg 

170 ;ao 
Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gin Ala Ala As? Lys Ala Ala Cys Leu Leu Pro 

190 200 
Lys Leu Asp Glu Leu Arg Asp Giu Gly Lys Aia Ser Ser Ala Lys -Gin Arg Leu Lys Cys 

210 220 
Aia Ser Leu Gin. Lys Phe Gly Glu Arg Ala Phe Lys Ala Tr? Ala Vai Aia Arg Leu Ser 

230 240 
Gin Srg ?hs Pro Lys Ala Giu Phe Ala Glu Val Ser Lys Leu Val Thr Asp Leu Thr Lys 

250 2S0 
val His Thr Glu Cys Cys His Giy As? Leu Leu Glu Cys Ala As? As? Arg Ala As? Leu 

270 280 
Ala Lys Tyr lie Cys Giu Asn Gin As? Ser lie Ser Ser Lys Leu Lys Giu Cys Cys Giu 

290 ' 300 

Lys Pro Leu Leu Glu Lys Ser His Cys lie Ala Glu Val Glu Asn Asp Glu Met Pro Ala 

310 320 
As? Lau Pro Ser Leu Ala Aia As? Phe vai Giu Ser Lys As? val Cys Lys Asn Tyr Ala 

330 340 
Giu Ala Lys As? : /ai Phe Leu Giy Met Phe Lau Tyr Glu Tyr Ala Arg Arg His Pro As? 

350' 360 
Tyr Ser Vai Vai Leu Lau Leu Arg Leu Ala Lys Thr Tyr Giu Thr Thr Leu Glu Lys Cys 



I 370 380 
Cys Ala Ala Aia As? Pro His Glu ;Cys Tyr Ala Lys vai Phe As? Glu Phe Lys Pro Leu 
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TIGUHZ I Cant . 

~~ ; ; ; too* 

-/a.i Cxc wj.u Pro G^n Asn Leu . He Lys Gin Asn Cys Glu Leu Phe Glu Gin Leu Civ Giu 



4 10 — — 

'yr Lys ?he "Gin Asn Ala Leu Leu val Arg Tyr Thr Lys Lys Val Pro Gin Val Ser 



420 
Thr 



430 440 
Pro Thr Leu val Glu Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His 



450 



Pro Glu Ala Lys Arc Mac ?rc Cys Ala Glu Asp Tyr Leu Ser Val Val Leu Asn Gin L 



460 



470 480 



Cys val Leu Sis Glu Lys Thr Pro Val Ser As? Arg Val Thr Lys Cys Cys Thr Glu Ser 



490 



Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Clu Thr Tyr Val Pro L 



500 



510 



Glu Phe Asn Ala Glu Thr. Phe Thr Phe His Ala Asp lie Cys Thr Leu Ser Glu Lys 



530 



S20 
Glu 



540 



Arg Gin lie Lys Lys Gin Thr Ala Leu Val Glu Leu Vai Lys His Lys Pro Lys Ala Thr 



550 560 

ys 



Lys Glu Gin Leu Lys Ala Vai Met Asp Asp Phe Ala Ala Phe Val Glu Lys Cys Cys L- 



570 



580 



Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu Gly Lys Lys Leu Val . Ala Ala Ser Gla 
Ala Ala Leu Giv Leu 
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FIGURE 2 DN'A sequence coding for ma cure HSA 



10 20 30 40 50 60 70 80 

GA7GCACACAAGAG7GAGG77GC7CA7CGG777AAAGA777GGGAGAAGAAAA777CAAAGCC77GG7G7TGA77GCC77 
D A ft X S 2 V A H ?. rXDLGZZNFXALVLIA? 

90 100 110 120 130 140 150 160 

TCC7CAG7A7C7TCAGCAC7G7CCA777GAAGA7CA7G7AAAA77AG7GAATGAAG7AAC7GAA777CCAAAAACA7G7G 
AQYi.QQC?FZDHVXLVNZVTErAX7C 

•70 180 190 200 210 220 230 240 

TTGCTGATGAGTCAGCTGAAAATTGTGACAAATCACTTCATACCCrTTTTGGAGACAAATTATGCACAGTTGCAACTCT? 
VA. DSS AS KCDKS1HTL7G0X 1 C 7 V A TL 

250 250 270 280 290 300 310 320 

CGTGAAACCTATGGTGAA^.TGGC?GACTGCTGTGCAAAACAAGAACCTGAGAGAAATGA.2lTGCTTC7TGCAACACAAAGA 
R Z 7 Y G 2 MADCCAXQEP2RN2C7LQ35XO 

330 340 350 360 370 380 390 400 

TGACAACCCAAACC7CCCCCGA77GG7GAGACCAGACG77GA7G7GA7G7GCAC7GC77T7CA7GACAATGAAGAGACA7 
DNPNLPRLVR7EV0VHCTAFHDHEZT 

410 420 430 440 450 460 470 480 

TTTTGAAAA-^A7ACTTATATGAAATTCCCAGAAGACATCCTTACTTT7ATGCCCCGGAACTCCTTTTCTTTGCTAA.^AGG 
? L X :< i L i Z I A R a H ? Y 7 Y A P Z L L T ? A X 3 

490 500 510 520 530 540 550 560 

TATAAAGCTGCTTTTACAGAArGTTGCCAAGCTGCTGATAAAGCTGCCTGCCTGTTGCCAAAGCTCGATGAACTTCGGGA 
Y X A A J TiCCQAAOKAACLlPXLDZLRD 

570 530 5S0 600 610 620 630 640 

TGAAGGGAAGGCTTCGTCTGCC^AACAGAGACTCAA^TGTGCCAGTCTCCAAAAATTTGGAGAAAGAGCTTTCAAAGCAT 
Z G X A S SAXQRLXCAS L Q X :7 G 2 R A 7 X A 

650 660 670 680 690 700 710 720 

GGGCAG TGGCTC GC CT G AGCCA GAG A777CCCAAAGC7G AG777GCAG AAG7T7CCAAG77AG7G ACAGA7C7 T ACCAAA 
tf A V A R L SQRFPXAEFA 2 V ' S X LVTDL7K 

730 740 750 760 770 780 790 800 

G7CCACACGGAATGC7GCCATGGAGA7C7GC77GAATG7GC7GA7GACAGGGCGGACC77GCCAAC7A7A7C7C7C-AAAA 
V£T2CCHGDLLZCADDRA0LAX?IC-.£N 

810 320 830 840 850 660 870 380 

7CAGGA77CGA7C7CCAG7AAACTGAAGGAA7GC7G7GAAAAACC7CTG7TGGAAAAA7CCCACTGCA77GCCGAAG7GG 
QDSiSSXLXZCCZXPLLZXSnCIAZV 

£90 900 910 920 .930 940 950 560 

AAAA7GA7GAGA7GCC7GC7GACT7GCC77CA77AGC7GC7GA7777G77GAAJVG7AAGGA7G777GCAAAAAC7A7GC7 
SNDZMPADLPSLAAOFVZSXDVCXNYA 

970 980 990 1000 1010 1020 1030 1040 

GAGGCAAAGGA7G7C77CC7GGGCA7G77777G7A7GAA7A7GCAAGAACGCATCC7GAT7AC7C7G7CG7GC7GC7GC7 
I A X D V 7 I G M F L Y I Y A S R H ? D Y S V V L L L 
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FIGURE 2 Conn . 

1050 1060 1070 1060 1090 1 1 00 J.1JJ3 1 120 

GAGACTTGCCAAGACATArGAAACCACTCTAGAGAACTGwGTGCCGCTGCAGATCCTCATGAATGCTATGCCAAAGTG? 
?. LAXTYZTTLIXCCAAADPHECYAKV 

1130 1140 5 150 1 160 1170 1180 1190 1200 

TCGATGAArTTAAACCTCTTGTG GAAGAGCCrCAGAATTrAAr CAAACAAAACTGTGAGCTTTrTGAGCAGCTTCGAGAG 
PDS7XPLV *~Z Z ? Q N L~~x XQNCZLrZQLGE 

1210 1220 1230 12 40 1250 1260 1270 1280 

TACAAATrCCAGAATGCCCTATTAGTTCCTTACACCAAGAAACTACCCCAAGTGTCAACTCCAACTCTTGTACAGGTCTC 
YKFQNALLVHYTKXVPQVSTPTLVZV S 

1290 1300 1310 1320 1330 1340 1350 1360 

.^GAAACCTAGGAAAAGTGGGCACCAAATGTTGTAAACATCCTGAAGCAAAAAGAATGCCCTCTGCAGAAGACTATCTAT 
RN1GXVGSKCCXH?£AKRMPCASDYL 

13"0 1360 1390 1400 1410 1420 1430 1440 

CCGTGGTCG7GAACCAG7TATGTGTGTTGCATGAGAAAACGCCAGTAAG7GACAGAGTCACAAAATGCTGCACAGAGTCC 
SVVLNQLCVLHZKTPVSD3VTXCCTSS 

1450 1460 1470 1480 1490 1500 1510 1520 

TTGGTGAACAGGCGACCATGCTTrrCAGCTCTGGAAGTCGATCAAACATACGTTCCCAAAGAGTTTAATGCTGAAACATT 
LVNRRPCFSALEVDETTVPKZPNAET? 

1530 1540 1550 1560 1570 1580 t590 1600 

CACCTTCCATGCAGATATATGCACACTTTCTGAGAAGGAGAGACAAATCA^GAAACAAACTGCACTTGTTGAGCTTGTGA 
T7KAD ICTLSZXZRQIXKQTALVZ1V 

1610 1620 1630 1540 1650 1660 "670 1S30 

AACAC>JVGCCCAAGGCAACAAAAGAGCAACTGAAAGCTGTTATGGATGATTTCGCAGCTTTTGTAGAGAAGTGCTGCAAG 
KHXPXATXZQLXAVMODFAArVZXCCX 

1690 1700 1710 1720 1730 1740 1750 1760 

GCTGACGATAAGGAGACCTGCTTTGCCGAGGAGGGTAAAAAACT7GTTGCTGCAAGTCAAGCTGCCTTAGGCTTATAACA 
ADDXZTC7AZZ CXKLVAAS QAALGL 

1770 17S0 
TCTACATTTAAAAGCATCTCAG 
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GAAGAGCCTCAGAATTTAATCACTGAGACTCCGAGTCAGCCCAACTCCCACCCCATCCAGTGG 
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Figure 6 Linker 5 showing the eight constituent oligonucleotides 
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Fjg, 7 Construction of pDBDF2 
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Fig. 8 Construction of pDBDF5 
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• 9 Construction of pDBDF9 
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. 10 Construction of pDBDF12 
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Figure II 



Name; pFHDEL 1 

Vector: pUCl8 Amp'v 2560bp 

Insert; hFNcDNA ~ 7630bp 
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